# High-accuracy transcription

Gigaam V2 Onnx
MIT
GigaAM v2 is an automatic speech recognition (ASR) model that supports Russian speech-to-text tasks, offering both CTC and RNN-T architectures.
Speech Recognition Other
G
istupakov
170
2
Whisper Small Turkish 0
Apache-2.0
Turkish speech recognition model fine-tuned based on OpenAI Whisper-small
Speech Recognition Transformers Other
W
ysdede
14
1
Kotoba Whisper V2.2
Apache-2.0
Japanese automatic speech recognition model based on Whisper, integrating speaker separation and punctuation addition functions
Speech Recognition Transformers Japanese
K
kotoba-tech
22.80k
47
Whisper Large V2 Ko
Apache-2.0
Korean automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper-large-v2, excelling on Korean datasets
Speech Recognition Transformers Korean
W
byoussef
94
22
Whisper Large V2 Hausa
Apache-2.0
This model is a fine-tuned version of OpenAI's Whisper Large-V2 for Hausa speech recognition tasks, trained on the Common Voice 11.0 dataset
Speech Recognition Transformers Other
W
DrishtiSharma
44
5
Stt Kr Conformer Transducer Large
This is a large-scale Korean automatic speech recognition model based on the Conformer-Transducer architecture, trained on the Ksponspeech dataset, suitable for Korean speech transcription tasks.
Speech Recognition Other
S
eesungkim
129
9
S2t Medium Librispeech Asr
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
S
facebook
1,086
9
Wav2vec2 Large Xls R 300m Assamese Cv8
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on Assamese datasets based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition Transformers Other
W
infinitejoy
18
0
Wav2vec2 Base 10k Voxpopuli Ft Hr
A speech recognition model based on Facebook's Wav2Vec2 architecture, pretrained on the VoxPopuli corpus and fine-tuned on Croatian data
Speech Recognition Transformers Other
W
facebook
20
0
Wav2vec2 Base 10k Voxpopuli Ft Nl
A speech recognition model based on Facebook's Wav2Vec2 architecture, pretrained on 10K hours of unlabeled Dutch data from the VoxPopuli corpus and fine-tuned on Dutch transcription data.
Speech Recognition Transformers Other
W
facebook
28
0
Wav2vec2 Punjabi Stt
This is a Punjabi speech recognition model based on the Wav2Vec2 architecture, capable of converting Punjabi speech into text.
Speech Recognition Transformers
W
addy88
17
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase